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Abstract. We prove that the notions of Krengel entropy and Poisson entropy 
for infinite-measure-preserving transformations do not always coincide: We 
construct a conservative infinite-measure-prescrving transformation with zero 
Krengel entropy (the induced transformation on a set of measure 1 is the Von 
Neumann-Kakutani odometer), but whose associated Poisson suspension has 
positive entropy. 



1. Introduction 

1.1. Entropy for infinite-measure-preserving transformations. There exist 
several notions of entropy for infinite transformations, which elegantly general- 
ize Kolmogorov's entropy of a probability-preserving transformation. Krengel [3] 
comes down to the finite-measure case by considering the entropy of the induced 
transformation on a set of finite measure: The Krengel entropy of a conservative 
measure-preserving transformation (X, B, fi,T) is defined as 

h Kl {X,B,fi,T):= sup n(A)h(A,BnA,fi A ,T A ), 

where F+ is the collection of sets in B with finite positive measure, ha is the 
normalized probability measure on A obtained by restricting /i to B D A, and Ta ■ 
A — > A is the induced map on A. Recall that this map is defined by 

T A (x) :=T r ^(x), 

where rA(x) := minjfc > 1 : T k (x) S A} is the first-return-time map associated to 
A. As soon as T is not purely periodic, Krengel proved that 

h Kl (X, B, h, T) = ft(A) h(A, BnA, fi A , T a ), 

where A is any finite-measure sweep-out set (i.e. a set such that {J n>0 T~' n A = X). 

The Parry entropy of an infinite-measure-preserving transformation T has been 
defined in [7] as the suprcmum of the conditional entropy of C with respect to T _1 C, 
for all CT-finite sub-tr-algebras C such that T~ 1 C C C. 

Recall now that to each infinite-measure-preserving transformation T we can 
associate a probability-preserving transformation T* called its Poisson suspension, 
and which can be described as follows (we refer to [8] for details): We consider a 
Poisson process on X with intensity /i, which we can consider as a random col- 
lection of particles. These particles are distributed over X in such a way that, 
denoting by Nb the random number of particles in any finite- measure set B, for 
any finite collection of pairwisc disjoint, finite- measure sets B\, . . . ,B n £ B, the 
random variables Nb 1 , ■ ■ ■ ,Ns n are independent, and follow Poisson distributions 
with respective parameters . . . , Then T* is defined on the canon- 

ical space of this Poisson process, and it consists in moving individually each of 
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Tower 1 Tower 2 



Figure 1. First steps in the construction of the Von Neumann- 
Kakutani odometer by cutting and stacking. 

these particle according to the transformation T on X. The Poisson entropy of 
an infinite-measure-preserving transformation was defined by Roy [8] as the Kol- 
mogorov entropy of its Poisson suspension. 

Relations between these notions of entropy are studied in [3J: On large classes 
of transformations {e.g. quasi-finite transformations, rank-one transformations), it 
is proved that Poisson entropy is equal to Krengel entropy and to Parry entropy. 
Moreover, in any case, Parry entropy is dominated by both Krengel and Poisson 
entropy. 

It was asked in [3| whether, for any conservative measure-preserving transforma- 
tion, these three definitions always coincide. The purpose of the present paper is 
to show that the answer is negative, by constructing a counterexample. 

Theorem 1.1. There exists a conservative infinite-measure-preserving transforma- 
tion with zero Krengel entropy (hence zero Parry entropy), but whose associated 
Poisson suspension has positive entropy. 

2. Construction 

2.1. Von Neumann-Kakutani odometer. The transformation T is constructed 
as a tower over the Von Neumann-Kakutani odometer S. Let us recall the con- 
struction of the latter by cutting and stacking (see Figure [J). We start with the 
interval A := [0, 1]. The first step consists in cutting A into two sub-intervals Ai 
and A \ A\ of measure 1/2, and stacking A\A\ over A\. We get a tower of height 
2 which we call Tower 1. After step n, A has been cut into 2" sub- intervals which 
are stacked to get Tower n. This means that at this step each point of A (except 
those lying on the top of the tower) is mapped by S to the point of A lying above 
it. We construct Tower n + 1 by cutting Tower n into two equal parts. We call 
A n+ \ the left half of the top interval of Tower n and we stack the right part of 
the tower over A n+ i, thus dividing by 2 the measure of the set where S is not yet 
defined. Repeating this procedure defines the Von Neumann-Kakutani odometer 
which preserves the Lcbcsgue measure on A. It is well known that the odometer S 
is ergodic and has zero entropy. 

2.2. Construction of T. T is constructed on K + in such a way that the induced 
transformation Ta coincides with the odometer previously defined. T is completely 
defined (up to isomorphism) by giving for each point x € A the first return time 
ta{x) to A. 

We fix an increasing sequence of integers (M n )„>o, with M n — > oo. For any 
n > 1, we choose a large enough integer k n (to be precised later). We define 
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the first return time to A so that its restriction to A n is uniformly distributed on 
{M„, M n + 1, . . . , M n + k n - 1} for any n. 

We will see in section 2] that by choosing k n large enough, the entropy of T* is 
positive. 



3. POISSON APPROXIMATION LEMMA 

The purpose of this section is to prove the key lemma. This lemma roughly states 
that when k n is large enough, it is almost impossible in the Poisson suspension to 
keep track individually of the particles when they leave A n if we only have access 
to the number of particles in A. For this, we compare two processes: The first one 
is simply an i.i.d. sequence of Poisson random variables, whereas the second one 
modelizcs particles leaving A n and coming back to A. The comparison between the 
two processes uses the notion of d-distance, of which we recall some properties. 

3.1. The d-distance. The d-distance between two stationary processes has been 
introduced by Ornstein for the proof of the isomorphism theorem of Bernoulli shifts. 
We refer to [6] or [9] for the properties of this distance which we use later and which 
we recall here. 

Let £ and £ be two stationary processes taking values in a countable alphabet 
B. For any integers p < q, we denote by £|£ the finite sequence (£ p , £ p +i, . . . , 

For L > 1, let Jl(£, C) be the set of all joinings of £|f and Cli , that is probability 
distributions on M L x B L whose marginals are the distributions of £|f and C,\i . We 
first define g?l for any L > 1, by 

efe&C) := 4 min E A k(e|f,C|f) 



where d^ is the Hamming distance between sequences of length L: 

L 



1 L 

d L (xi ...xl,z\...zl) ■= j ^ 1 



L 

l 

Then, the d-distance between £ and £ is defined by 

JfcC) :=sup4(£,C)- 

L>1 

It can be shown that d(£, £) is also the minimum of A (£o ^ Co) when A ranges over 
all stationary joinings of £ and £. 

The two key properties of the c?-distance that we shall use arc: On the one hand, 
the fact that entropy of processes close in d-distance can be compared (Lemma 13.11 
below). On the other hand, a practical tool to estimate the J-distance between 
processes using conditional distributions on the past: If, for all large enough n, 



(i) E | p ( e ° = & l ?l= ") " p ( Co = 6 I CI= ") 



< £ 



for all past outside a set of measure e and all past £|-n outside a set of measure 
e, then J(£, £) < 3e. Moreover, the same conclusion holds if we replace in (TTJ) the 
conditional distributions with respect to the past by conditional distributions with 
respect to finer cr-algebras. 
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Lemma 3.1. Let £ be a stationary process taking values in a countable alphabet 
B which has Unite entropy. For any e > 0, there exists 6 > such that any 



Proof. Without loss of generality we can assume B = Z + . For any integer £ > we 
define the process (£ A £) taking values in the finite alphabet {0, 1, ... ,£} by 



We choose £ large enough so that h (£ A £) > h(£) — e/2. Then we use the fact that 
entropy is a continuous function of processes taking values in a given finite alphabet, 
when these processes are topologized with the (i-distance (see e.g. [9] page 100). 
Therefore we can find 5 > such that any process taking values in {0, 1, . . .,£} at 
d-distancc at most S from (£ A £) has entropy at least — e. Now, if c?(£, £) < S, 
then d ((£ A £) , (C A £)) < 6 (where (£ A £) is defined from £ in a similar way), hence 



3.2. Comparison between connected and disconnected processes. Let A 

be a finite alphabet, P B and P w be two probability measures on A and Aaxa be 
a joining of P B and P w . Let 5 be some fixed positive real number. We define two 



The process £ is constructed from two independent sequences of i.i.d. random 
variables distributed according to the Poisson distribution of parameter 8/2, which 
can be interpreted as numbers of black and white particles lying on each site of Z. 
Then to each black (respectively white) particle we randomly and independently 
associate a label picked in A according to P B (respectively P w ). For any x S Z 
and any labels a,b € A, £j* (a) (respectively ^ (&)) is the total number of black 
(respectively white) particles labelled by a (respectively b) at position x. In other 
words, the process £ associates in an i.i.d. way to each site igZa finite sequence 
^gA of independent random variables respectively distributed 
according to the Poisson distribution of parameter P B (a)8/2 and P w (b)S/2. 

Let k and M be two integers. The process £ is also constructed from black and 
white particles on Z, but which are no longer independent. The number of black 
particles at each site is given by a sequence of i.i.d. random variables distributed 
according to the Poisson distribution of parameter 6/2. For each black particle at 
position x £ Z, we first pick a random integer j, uniformly in {M, M + 1, . . . , M + 
k — 1} and independently of all other particles. Then we link the black particle to 
a white particle that we put at position x + j. For each such couple of black and 
white particles, a couple of labels is picked in A x A with probability Aaxa^ The 
first label is associated to the black particle and the second label to the white one. 
Then, for any x € Z, C, x = (Cf (a), C^(^)))a,beA denote the number of black particle 
labelled by a and the number of white particles labelled by b at position x. 

Lemma 3.2. For any e > 0, if k is large enough, d(£, £) < e. 

Proof of Lemma \3.Sl simple case. We first prove the lemma in the case where A is 
reduced to a singleton. Since all particles have the same label, we just forget it and 
simply count the number of black and white particles on each site. 





MO ^ h((A£) > h{£)-e. 



□ 



processes £ and C on (N A ) . 
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We now prove that if k is large enough, for any n > M + k, 

j.eeft 



J 



< e, 



with probability 1 — e on Cl-n- Since is Poisson distributed with parameter 6/2, 
and independent from C|l n ; an d sm ce Co 1 ', is independent from ^q 8 conditionally to 
the past, it is enough to prove that if k is large enough, for any n > M + k, 



E 



P(C = t\C\z] l ) 



-5/2 



(6/2Y 



< s, 



with probability 1 — e on C|-n- 

In fact we rather condition with respect to an enriched past (see Figure | 
Assume that besides the number of black and white particles on each site x € 
{— n, . . . , —1}, we also know which the links are between them. 




Only non-free Here, free black particles All black particles 

black particles: may be linked to a are free, but have no 

no influence on white particle at influence on 



Figure 2. The enriched past. 



From what we know, we can distinguish two kinds of black particles between —n 
and — 1: Those which are linked to a white particle lying on the left of 0, and those, 
called free particles, whose white particle's position is unknown. Observe that only 
free particles may have some influence on . Hence, black particles lying on the 
left of site — (M + k — 1) have no influence on since they are not free. Black 
particles lying on the right of site —M are free but nevertheless have no influence 
on (ff since a black particle is linked to a white particle at distance at least M. 

So it remains to study the influence on of free black particles at sites between 
— (M + k — 1) and —M. For 1 < j < k let us denote by Fj the number of free 
particles at site — (M +k)+j. Any black particle at site —(M + k)+j has probability 
j/k to be free. Therefore, Fj follows the Poisson distribution with parameter 

Fix 1 < j < k. Assume there is a free particle at site — (M + k) + j. Since there 
are j possible positions for its white particle, the latter has probability 1/j to lie 
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at site 0. Hence, conditionally to our enriched past, the number of white particles 
at site can be written as 

k Fj 

EE s i= 

3=1 t=l 

where (Bj) arc independent Bernoulli random variables with respective parameter 
l/j. The law of such a sum of independent Bernoulli variables is close to a Poisson 
distribution of parameter 5/2 as soon as the sum of the parameters is close to 5/2 
and all parameters are small enough (see [2], Theorem 23.2 page 312). Therefore 
we can choose a large enough integer J, and e± small enough so that, if Z is a sum 
of independent Bernoulli random variables, each with parameter less than 1/ J and 
such that the sum of parameters is within E\ of 5/2, then 



E 



\Z = £) -exp(-,5/2)- (<5/2)< 



< e. 



We have now to avoid bad configurations, that is configurations of the enriched 
past which have free particles close to — (M + k — 1) giving rise to Bernoulli with 
large parameters, and configurations such that the sum of the parameters is not 
close enough to 5/2. 

Control of the parameters' size. We compute the probability that no free particle 
lie between —{M + k) + 1 and — (M + fc) + J: 

^ = o,, = v..,.) = n»p(-|)=e XP (-^ 

Under this condition, ^ is (conditionnally to the enriched past) the sum of inde- 
pendent Bernoulli variables with parameters smaller than 1/J. If fc is large enough, 
this happens with probability larger than 1 — e/2. 

Control of the parameters' sum. Since the Fj are independent and Poisson dis- 
tributed with parameter ||, the expected value of the sum S := 2j=i i ^^j °^ ^ ne 
parameters is 5/2, and its variance is 

1 Sj 5 



^ 7 2 2fc 2fc ^ j ' 

= 1 J .7 = 1 J 



3 = 1 3 = 

Hence, if fc is large enough, 

P{\S-5/2\ < si) > 1 -e/2. 
Putting things together, we have proved that with probability larger than 1—e on 

50 



the enriched past, the conditional distribution of the number (^ of white particles 



at site satisfies 



E 



P((W = £|enriched past) - exp(-<V2)- ^ 



< E. 



£>0 

This proves Lemma 13.21 when A is reduced to a singleton. □ 
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Proof of Lemma \3.'A general case. We consider the family of independent processes 
Qa,b = ^B, a ,b } (W.a.b^ (^ fr) e a x A, which counts the number of black and white 
£ particles at x belonging to a pair of black and white particles respectively labelled 
by a and b. Then £ a,b is a simple-case £ process, for which the expected number of 
black particles per site is <5Aaxa(«, b)/2. From the proof in the simple case, we know 
that as soon as k is large enough, the d-distance between £ a,b and is smaller 
than e/|A| 2 , where £ a,b = (£ B > a > b f £ w > a > b '\ is composed of two i.i.d. sequences of 
Poisson random variables of parameter 5Aaxa(ci, b)/2. We can recover C^(a) and 
Cf (b) by 

C?(a)=£C?" b and Cf (6) = £ & 

6GA aGA 

On the other hand, ^2 b€A £B,a,b (respectively X^ga £, W ' a ' b ) has the same distri- 
bution as £ B (a) (respectively £ w (b)): It is an i.i.d. sequence of Poisson random 
variables of parameter P B (a)S/2 (respectively P w (b)S/2). Summing over a and b, 
it follows that d(C,£) < £• □ 

4. Positive Poisson entropy 

We denote by the stationary process living in the Poisson suspension of our 
transformation T, defined by 

Q. := number of particles in A at time x. 

The purpose of this section is to show that the entropy of the process is positive 
as soon as the fc ra 's are chosen large enough. This will be proved by showing that 
the J-distance between and an i.i.d. sequence of random Poisson variables 
with parameter 1 can be made as small as we want. By Lemma 13.11 this will be 
enough to conclude. 

Our strategy is the following: As one goes along in the construction of the return 
time to A, we define a sequence (TW) of infinite -measure-preserving transforma- 
tions, approximating the final transformation T. Then we consider the process £( n > 
living in the Poisson suspension 

of t(") : 

:= number of particles in A at time x for the Poisson suspension over 

The transformation T' ) is constructed by stacking infinitely many pairwise dis- 
joints intervals of length 1, A being one of them, into a doubly infinite tower, each 
interval being mapped onto the one just above. Therefore, as mentioned previ- 
ously, £ (0) is an i.i.d. sequence of Poisson variables with parameter 1. At step n 
of the construction, we define the return time to A on the subset A n . This return 
time to A will be the same for all transformations T^ m \ m > n, and for the final 
transformation T. By choosing the return time adequately, we will make sure that 

j(n-l)^(n) N 



so that for all 



n 



(2) d(e (0) ,c 



(») 



< e. 



Let us describe the first step. Recall that Tower 1 is of height 2, with basis A\ 
(See Figured]). We cut A\ into k\ equal subintervals, and we define on A\ the 
return time to A to be M\ +j — I on the j-th subinterval. We insert Tower 1 into a 
doubly infinite tower of intervals of length 1/2 and add spacers between A\ and its 
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A\(A 1 UA 2 ) 



A 




Ai 



Step Step 1 Step 2 

Figure 3. First steps in the construction of the return time to A. 

image by the odometer S: We insert Mi + j — 2 spacers of width l/(2fci) between 
the j-th subinterval of Ai and its image by S. The transformation is defined 
by mapping each point to the point right above it. 

The process counts the number of particles in A at time x for the Poisson 
suspension over T^ 1 ': We interpret points in A\ as black particles and points in 
A \ Ai as white particles. For the suspension over T^-°\ black and white particles 
are independent, whereas for the suspension over they are linked through 

the return time to A of the point corresponding to the black particle. Hence, a 
direct application of Lemma 13.21 in the simple case with no alphabet gives that 
< e/2 when ki is large enough. 

Suppose the return time to A has already been defined on all A^, 1 < i < n — 1. 
Consider Tower n. The return time to A has already been defined on all rungs but 
the roof and A n (which is the rung of level 2' 1 ~ 1 ). For each point y in A n , let h B (y) 
be the sequence of the return times to A when we climb the first half of Tower n 
before reaching y, and let h w (y) be the sequence of the return times to A when we 
climb the second half of Tower n starting from Sy. We denote by r(y) the return 
time to A, which is to be defined at this step. We want r(y) to be independent 
of h B (y) and h w {y). To this end, we consider the finite partition of A n generated 
by h B and h . Each atom of this partition is cut into k n equal pieces, and we 
define r to be M n + j — 1 on the j-th piece of each atom. Here is how we define 
the transformation T^> : we insert Tower n into a doubly infinite tower of intervals 
of length 1/2™ and insert as many spacers as we need between the rungs of Tower 
n to achieve the already defined return time to A. The transformation TW maps 
each point to the point right above it. 

Let us turn to the estimation of d(^ n ~ 1 ',^'"^) for n > 2. We want to apply 
Lemma 13.21 Black particles are points in A n and white particles are points in 
SA n , Let Rn-i be the maximum value of the already defined return time to A on 
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A\ U • • ■ U A n _i. We consider the finite alphabet A := {1, • • ■ , . To 

each point y in A n , we associate the label h B (y) € A of the return times to A when 
we climb the first half of Tower n before reaching y. To each point Sy in SA n , we 
attach the label h w (y) £ A. which is the sequence of the return times to A when we 
climb the second half of Tower n starting from Sy. Let the process £ (respectively 
£) count the numbers of black and white particles together with their label in the 
suspension over 

^("-1) (respectively T^). These processes are exactly of the form 
studied in Lemma 13.21 

Now, observe that we can recover Z^-V and from £ and (: 

^ n_1) = E £o B w + € (&) + - + ^ 1+ ... +fe2 „_ 1 _ 1 

+ E tfw + z\ w + • • • + ^W-.+w.j w, 

and 

= E Co s w + c£ w + • • • + cf 1+ ... +/l2 „_ 1 _ 1 (ft) 

+ E Co w (ft) + c\ w + • • • + c^ 1+ ...+h 2 „_ 1 _ l) (ft)- 

heA 

It follows that, if £ and C coincide on {-i^-i^" -1 - 1), ... , R n -x(2 n - 1 - 1)}, then 
£o" 1 ' > = fo • By Lemma [3.21 d(£, C) can be made arbitrarily small by choosing 
k n large enough. Hence we can assure that d(^ n— 1 ?C ) < 2 _Tl e. 

Finally, note that since M n is increasing, the return time to A on the roof of 
Tower n (the union of Ai, i > n) will be larger than M n+ \. Hence, for any L > 0, 

if n is large enough so that M n+ i > L, the distribution of 1 coincides with 

the distribution of ^°°)|^ 1 . Therefore, 

d L (e (oo) ,C (0) ) = d L (CW,C (0) ) < d(&\e ] ) < e t 

which implies 

5. Comments and open questions 

In view of previously known results on the subject, some comments on the 
infinite-measure-preserving transformation T constructed in Section[5]may be made. 

First, although its construction is derived from the standard cutting-and-stacking 
procedure used to build the most elementary rank-one system (the Von Neumann- 
Kakutani odometer), the transformation T is not even of finite rank. Indeed, Propo- 
sition 10.1 in [3J shows that for finite-rank systems, both the Poisson and Krengcl 
entropy vanish. 

Second, it was also proved in [3J that Krengel and Poisson entropies coincide 
for quasi- finite transformations, namely transformations for which there exists a 
sweep-out set A of measure 1 such that the return-time partition of A has finite 
entropy. There exist only few examples of transformation for which the non quasi- 
finiteness has been established: an unpublished example constructed by Ornstein 
has been mentioned by Krengel in [5] , and the only published example which we are 
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aware of is a rank-one system, given by Aaronson and Park in [T] . Our construction 
thus provides a new example of a non quasi-finite transformation. 

After having proved that the different notions of entropy for infinite-measure- 
preserving transformations do not always coincide, a natural question is to ask 
whether they are always ordered in the same way: Is it true that Poisson entropy 
always dominates Krengel entropy? Can we at least decide whether zero Poisson 
entropy implies zero Krengel entropy? And what about similar questions regarding 
the comparison between Parry entropy and Poisson entropy? It may be worth 
recalling here that the equality of Parry entropy and Krengel entropy in the quasi- 
finite case was proved by Parry in 1969 [7], but that the question whether they 
always coincide is, as far as we know, still open. 

Acknowledgements. The construction of the transformation T in Section [2] has 
been inspired by a private communication of Benjamin Weiss concerning the Shannon- 
McMillan-Breiman theorem. 

We are also much indebted to Emmanuel Roy for stimulating conversations on 
the subject. 
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